Harmonicity and dynamics based audio separation
نویسندگان
چکیده
Audio signal source separation is an interesting task performed by humans. In this paper, we present a frequency grouping algorithm based on principles of harmonicity and dynamics: frequency components with a harmonic relation and similar dynamics belong to the same source. The grouping is demonstrated for a variety of sound mixtures.
منابع مشابه
A Segmental Spectral Flatness Measure for Harmonic-Percussive Discrimination
In a variety of applications of audio signal processing, for example blind source separation (BSS), harmonic signals need to be treated differently from percussive signals. Thus, recognizing if a signal is harmonic or percussive is a very helpful and frequently used preprocessing step. Different measures have been proposed to capture either the harmonicity or the percussivity of a signal, using...
متن کاملAdvances in audio source seperation and multisource audio content retrieval
Audio source separation aims to extract the signals of individual sound sources from a given recording. In this paper, we review three recent advances which improve the robustness of source separation in real-world challenging scenarios and enable its use for multisource content retrieval tasks, such as automatic speech recognition (ASR) or acoustic event detection (AED) in noisy environments. ...
متن کاملA Technique towards Automatic Audio Classification and Retrieval
Audio classification is very important in many audio applications so that different audio signal can be processed appropriately. We propose an audio classification scheme which will categorise audio based on a number of audio features. These features include silence ratio, spectral centroid, harmonicity and pitch. Our preliminary experiments with silence ratio feature produce very promising cla...
متن کاملMulti-speaker meeting audio segmentation
This paper presents segmentation of multi-speaker meeting audio into four different classes: local speech, crosstalk, overlapped speech and non-speech sounds. Firstly, Bayesian Information Criterion (BIC) segmentation method is used to pre-segment the meeting according to speaker changing points. Then, harmonicity information is integrated into acoustic features to differentiate speech from non...
متن کاملSpeech/laughter classification in meeting audio
In this paper, harmonicity information is incorporated into acoustic features to detect laughter segments and speech segments. We implement our system using HMM (Hidden Markov Models) classifier trained on Pitch and Harmonic Frequency Scale based subband filters (PHFS). Harmonicity of the signal can be determined by variation of the pitch and harmonics. The cascaded subband filters are used to ...
متن کامل